Comparative study of different parameters for temporal decomposition based speech coding
نویسندگان
چکیده
Temporal decomposition (TD) is an e ective technique to compress the spectral information of speech through orthogonalization of the matrix of spectral parameters leading to an e cient rate reduction in speech coding applications. The performance of TD is function of the parameters used. Although \decomposition suitability" of a parameter set is typically de ned on the basis of \phonetic relevance" criterion, it can not be directly used in speech coding. Instead, quality evaluation of reconstructed speech is more appropriate. In this paper, we extend our earlier work in this area and attempt to assess several \popular" spectral parameter sets from the viewpoint of decomposition suitability in very low-rate speech coding using parametric, perceptually-based spectral, and energy distance measures.
منابع مشابه
Coding Speech at Very Low R and Temporal Deco
This paper presents a new method for speech coding at rates around 1.2 kbps based on STRAIGHT, a high quality speech analysis-synthesis method. For encoding spectral information, Modified Restricted Temporal Decomposition (MRTD) based vector quantization is used, where MRTD is a method of temporal decomposition for line spectral frequency parameters. Meanwhile, pitch and gain parameters are cod...
متن کاملVery low rate speech coding using temporal decomposition and waveform interpolation
In very low rate coding the aim is to accurately represent speech characteristics as efficiently as possible. High coding gains for the spectral features can be achieved through the use of temporal decomposition. Waveform interpolation coders accurately represent the excitation using characteristic waveforms (CWs) extracted at a constant rate. In this paper, the two approaches are combined into...
متن کاملPhoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain
This article presents a new feature extraction technique based on the temporal tracking of clusters in spectro-temporal features space. In the proposed method, auditory cortical outputs were clustered. The attributes of speech clusters were extracted as secondary features. However, the shape and position of speech clusters change during the time. The clusters temporally tracked and temporal tra...
متن کاملHierarchical temporal decomposition: a novel approach to efficient compression of spectral characteristics of speech
We propose a new approach to Temporal Decomposition (TD) of characteristic parameters of speech for very low rate coding applications. The method models the articulatory dynamics employing a hierarchical error minimization algorithm which does not use Singular Value Decomposition. It is also much faster than conventional TD and could be implemented in realtime. High exibility is achieved with t...
متن کاملSpeech Enhancement using Adaptive Data-Based Dictionary Learning
In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...
متن کامل